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(57) Abstract 

Microsatellite markers arc provided which are useful in identifying linked markers for canine genetic diseases and traits. The 
microsatellite markers are derived from regions of genomic DNA which contain a repeat motif, flanked by unique sequences. The number 
of units contained within the repeat motif is variable, such that various different alleles are present in any given population. The microsatellite 
markers and their progeny are especially useful in detecting genetic diseases not phenotypically visible and identifying carriers of recessive 
diseases, as illustrated in the figure. In a preferred embodiment, microsatellite markers are provided which may be used to detect the canine 
copper toxicosis gene. 
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MICROSATELLITE MARKERS FOR IDENTIFYING 
CANINE GENETIC DISEASES OR TRAITS 

FIELD OF THE INVENTION 

This invention relates generally to genetic markers and methods of making 
5 and using such markers, and more particularly, to a microsatellite marker that may 
be used to detect copper toxicosis in canines. 

BACKGROUND OF THE INVENTION 
Due to inbreeding and the relatively shallow gene pool, a large number of 
genetic diseases are present in dogs (Clark, R.D. et al., Medical and Genetic 
0 Aspects of Purebred Dogs (Forum Publications, Fairway, KS) (1994) and Robinson, 
R„ Canine Pract 16:29-34 (1991)). Some of these genetic diseases such as copper 
toxicosis in the Bedlington terrier breed, are so prevalent in a particular breed that 
the mutant allele frequency may be higher than that of the normal allele (Heritage, 
M.E. etal., J. Small Anim. 28:1141-1151 (1987); and Yuzbasiyan-Gurkan. V. etai., 
5 Genomics 15:86-90 (1993)). Other genetic diseases cross many breeds, as 
exemplified by progressive retinal atrophy causing blindness (Barnett, K.C., Adv. Vet. 
ScL Comp. Med. 20:9-67 (1976)) and hip dysplasia resulting in painful and crippling 
arthritis (Corley, E.A., Small Anim. Pract 22:570-593 (1992)). 

Canine copper toxicosis (CT) is an autosomal recessive genetic disorder of 
copper accumulation which results in severe liver damage. Unless specific' anti- 
copper treatment is instituted, affected dogs die by three to seven years of age. 
While reported in several breeds, it is best characterized in Bedlington terriers, with 
the frequency of the defective gene being estimated at 50%. The disease is also 
prevalent in the West Highland White Terrier and Keeshond. 

Currently, the only method for diagnosing affected CT dogs is by a 
quantitative liver copper assay in a liver biopsy sample, after one year of age. 
Unfortunately, heterozygous and homozygous normal animals are indistinguishable 
from each other by this test. In order to determine if a dog is a heterozygous carrier, 
test-breeding strategies must be employed which require that there be a dog of a 
known genotype to breed against the potential carrier. This process is very costly 
and results in the birth of many affected individuals. It is therefore impractical for 
breeders to identify breeding stock free of the gene and currently carriers of the 
gene are only identified after they are found to be the parents of an affected dog. 

Because like CT, many of the canine genetic diseases are recessive, various 
methods have been investigated which would identify, on a molecular level, 
phenotypically normal carriers. One method that has been employed is the whole 
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gene subtraction method. This approach requires the sorting out of differences 
between DNA from those with or without the disease or trait with molecular 
manipulation methods. Unfortunately, this technique is somewhat impractical and 
requires that ali variability within individuals with the trait as well as the variability 
5 within those without the trait independent of the trait, be differentiable from the one 
or few that are dependent on the trait. Furthermore, this method has only been 
demonstrated on very simple organisms such as yeast, and while this technique 
appears theoretically possible for higher species, it rapidly becomes impractical, as 
it requires many breeding studies of large numbers of affected animals. 
10 An alternative method, the use of restriction fragment length polymorphisms 

(RFLP), is extremely labor intensive and expensive with respect to both 
characterization and analysis. Furthermore, this technique requires large quantities 
of DNA, generally is limited to only two alleles, and only a few loci have thus far 
been characterized for the canine genome. It appears that with this method, a 
15 separate genetic system must be generated for each breed of dog. and such a 
library may not be sufficiently variable in most situations of interest. 

The randomly amplified DNA fragment length polymorphism (RAPD) 
approach uses random primers to amplify fragments of genomic DNA that vary from 
individual to individual within a species. While the primers are relatively easy to 
20 generate, the method is unreliable with minor experimental changes resulting in the 
resolution of different DNA band patterns. Furthermore, only a few such bands have 
been characterized for the canine genome. 

The candidate gene method is another alternative wherein one or more 
candidate genes is identified based on what is known about the biochemical and 
25 clinical or other phenotypic attributes of the disease or trait and information about 
similar conditions in another species where a gene has been identified for a similar 
trait. This approach was taken in evaluating genes linked to the Wilson's disease 
gene in humans, a disease similar to CT. Unfortunately, the genes linked to the 
Wilson's disease in humans were not linked to CT in dog (Yusbasiyan-Gurkan, V 
30 et al„ Genomics 15:86-90 (1993)). Thus, even under the best-case scenario, the 
candidate gene method is merely a guess and the approach is of course, further 
limited by the availability of identified genes. 

Because canine pedigrees for various genetic disease are abundant, with 
several generations and two or more affected members present in many cases, 
35 these pedigrees lend themselves to linkage studies, provided polymorphic markers 
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are available. Since most of the breeding is controlled, identification of linked 
markers would allow concerned breeders to greatly reduce the incidence of these 
diseases in future generations. 

One type of marker that has been developed consists of simple sequence 

5 length polymorphisms (SSLPs). SSLPs arise from a varying number of repeats of 
a simple sequence, such as a dinucleotide repeat at a given locus, and have been 
reported to be frequent in most eukaryotic genomes (Tautz. D. et al., Nucleic Acids 
Res. 12:4127-4138 (1984)). Such loci, also referred to as microsatellites (Tautz, D., 
EXS: DNA Fingerprinting: State of the Science 1:21-28 (1993)). are best exemplified 

) by those containing the (CA) n motif and are found to be highly polymorphic in many 
species and are being successfully used in the construction of genetic maps of the 
human (Weissenbach, J. etal., Nature 359:794-801 (1992)), mouse (Dietrich, W. et 
al., Genetics 131:423-477 (1992)), rat (Serikawa, T. et al., Genetics 131:701-721 
(1992)) and bovine (Barendse, W. etal.. Nat. Genet. 6:227-235 (1994)) genomes. 
High polymorphic information content and amenability to analysis by polymerase 
chain reaction (PCR) and thus to possible automation, make microsatellites excellent 
linkage and mapping tools. 

CA microsatellites from the canine genome have been identified and their 
polymorphism evaluated on sets of unrelated dogs (Holmes. N.G. et al.. Anim. 
Genet. 24.289-292 (1992)) or mixed bred dogs and beagles (Ostrander. E.A. et al.. 
Genomics. 16:207-213 (1993)). Presently there are about 150 SSLP-type markers 
for the canine genome available. Unfortunately, these known markers lack the 
ability to detect a linked marker for any genetic trait, because of the low probability 
of finding a linked marker sufficiently close to a given genetic locus, to ensure 
detection. Many purebred dog populations have a relatively high level of inbreeding 
which makes it important that such markers be very polymorphic. Further, important 
genetic diseases occur across many dozens of breeds, requiring the markers be 
polymorphic in most, if not all. breeds with many different breeds having varying sets 
of genetic problems. 

It would thus be desirable to provide a method for identifying genetic diseases 
and traits in canines. It would also be desirable to provide a method for identifying 
genetic diseases and traits in canines which has high variability and low breed 
specificity. It would further be desirable to provide a method which allows breeders 
to select and breed for certain favorable characteristics, or conversely, to avoid 
unfavorable diseases and traits. It would further be desirable to provide a method 
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which allows the detection and screening of a recessive genetic disease such as 
copper toxicosis, which is phenotypically undetectable in heterozygote carriers It 
would further be desirable to provide a method for identifying a carrier of a genetic 
d.sease or trait and affected individuals without undergoing test-breeding 
5 expenments. It would also be desirable to provide genetic markers for the canine 
genome. It would further be desirable to provide a marker for the CT gene in 
canines. 

SUMMARY OF THE INVENTION 

A set of microsatellite markers are provided which are useful in identifying 
10 Meed markers for canine genetic diseases and traits. In particular, five hundred and 
mneteen microsatellite DNA markers are provided which are highly variable within 
and across many breeds of dogs. The microsatellite markers are derived from 
regions of genomic DNA which contain a repeated motif e.g.. (CA) n , flanked by 
unique sequences. The number of units contained within the repeat motif is 
variable, such that various different alleles are present in any given population. The 
unique flanking sequences may be used as polymerase chain reaction (PCR) 
primers which allows for the rapid amplification and characterization of each locus 
from a small amount of DNA. Thus, each microsatellite marker has a unique set of 
primers. The microsatellite markers and their progeny are especially useful in 
detecting genetic diseases not phenotypically visible and identifying carriers of 
recessive diseases. In a preferred embodiment, microsatellite markers are provided 
which may be used to detect the canine copper toxicosis gene. 

In addition to identifying canine genetic diseases such as copper toxicosis 
the microsatellite markers may also be used to create a genetic map of the canine 
genome, generate specific breed profiles, settle parentage disputes and identify dogs 
by DNA fingerprinting. Pedigrees of affected individuals, their siblings, parent and 
progeny can also be created. Breeders and owners can thus choose breeding stock 
thereby reducing and possibly eliminating the incidence of specific genetic 
diseases. 

Additional objects, advantages, and features of the present invention will 
become apparent form the following description and claims taken in conjunction with 
the accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The various advantages of the present invention will become apparent to one 
skilled in the art by reading the following specification and by referencing the 
following drawings in which: 
5 Figure 1A is a bar graph showing the average and standard deviation of 

heterozygosity percentages across loci within a breed; 

Figure 1B is a bar graph showing the average and standard deviation of 
heterozygosity percentages across breeds within a locus; 

Figures 2A-2D are photographs of gels showing marker locus D02011 in 
10 various breeds; and 

Figure 3 is a photograph of a gel showing segregation of alleles at the 
C04107 locus in a Bedlington terrier pedigree. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Five hundred and nineteen microsatellite markers from specific gene loci are 
15 provided which are highly variable within and across many breeds of dogs. The 
microsatellite markers of the present invention comprise a repeat motif e.g., (CA)„, 
found in the canine genomic DNA. flanked by unique sequences. The unique 
sequences (also referred to herein as primer pairs) may be used as PCR primers, 
allowing the rapid amplification and thus detection of the sequence of interest in a 
20 small DNA sample. Table 2A sets forth the microsatellite markers of the present 
invention. The microsatellite markers and their progeny are especially useful in 
detecting genetic diseases not phenotypically visible and identifying carriers of 
recessive diseases. 

In a preferred embodiment, microsatellite markers are provided which may 
25 be used to detect a carrier of the canine copper toxicosis gene. As further set forth 
in Specific Example II below, marker locus C04107 may be used to predict the 
inheritance of alleles at the copper toxicosis locus. C04107 has also been used to 
locate two other marker loci C04107B and C04107C. which either singly, or as a 
group, may also be used to detect the copper toxicosis gene. 
30 The method of the present invention is useful for identifying disease free 

individuals (homozygous normal), carriers (heterozygous) and affected individuals 
(homozygous affected) at any stage of development. While a single marker may fail 
to provide the required information in any particular pedigree, a series of progeny 
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matters will, and thus such a family of progeny markers derived from the linked 
markers set forth herein, are also included in the invention. 

SPECIFIC EXAMPLE I 
Materials and Methods 
5 Isolation and Characterization of Microsatellite Loci. Established protocols 

were used for the cloning and screening procedures as described (Sambrook. J. et 
al.. Molecular Cloning. A Laboratory Manual (2nd ed. Cold Springs Harbor: Cold 
Springs Harbor Laboratory Press) (1992)). Genomic DNA was isolated from a 
peripheral blood sample from a Labrador retriever and partially digested with Bam 
10 HI. Size selected fragments purified from agarose gels using QIAEX beads (Qiagen 
Corp.. Chatsworth, CA) were cloned into the phagemid vector pBS (Stratagene. La 
Jolla, CA) to construct a library of average insert size of 600 bps and propagated in 
the host XL-1 blue. The library was plated at low density (about 500 colonies/plate) 
without amplification. Duplicate nitrocellulose colony lifts were prepared, denatured 
15 and hybridized with (CA)„ oligomer, labeled with M P dCTP using terminal 
transferase. Positive colonies were picked with a sterile pipette tip and lysed in 50 
pi of a solution consisting of 1% Triton X 100. 20 mM Tris and 2 mM EDTA. Using 
primers complementary to the T3 and T7 promoter sequences which flank the 
cloning site, the inserts were amplified from 1-2 pi of the colony Jysate in polymerase 
20 chain reactions for 30 cycles of 94. 55 and 72'C at 1. 2 and 3 min.. respectively 
after an initial denaturation at 94»C for 4 min. The standard buffer, nucleotide and 
primer concentrations were 50 mM Tris-HCI (pH 8.3 at 25°C), 50 mM KCI. 1.5 mM 
MgCI 2 . 200 fM dNTPs and 40 pmoles of each primer in 100 „\ reactions. PCR 
reactions were earned out on either a Perkin-Elmer Cetus (Perkin Elmer. Corp. 

25 Norwalk.CT)oranMJRPTC-100thermocycler(MJResearch.Watertown.MA). To 
carry out secondary screenings of the clones, aliquots of the amplification products 
were run on 1.5% agarose TBE gels (90 mM Tris. pH 8.3. 90 mM boric acid. 2 mM 
EDTA). Southern blot analysis was carried out on the gels after transfer to 
Gene-Screen Plus membranes (NEN. Boston. MA) using the alkaline transfer 

30 protocol. The membranes were probed with (CA) 16 oligomers, 3' end-labeled with 
digoxigenin-dUTP using terminal transferase. A chemiluminescence detection 
system based on Lumi-Phos 530 as a substrate was used to detect positive 
hybridization signals following the recommendations included in a commercial kit. 
Genius (Boehringer Mannheim Corp.. Indianapolis. IN). The membranes were 
35 washed to a final stringency of 0.1 X SSC (1 X SSC = 15 mM sodium chloride. 1.5 
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25 



30 



mM sodium citrate) at 65°C. The blots were than processed for immunological 
detection as described by the manufacturer. Once a clone was confirmed to be 
positive, the corresponding amplification product was then purified using QIAEX 
beads (Qiagen Corp., Chatsworth. CA) after electrophoresis on TAE gels (40 mM 
5 Tris acetate. pH 8.3, 2 mM EDTA) and directly sequenced using cycle sequencing 
(Delta Taq 2.0 Cycle Sequencing Kit, United States Biochemical Corp.. Cleveland. 
OH). The sequencing reactions were carried out according to the manufacturer's 
instructions with the slight modification that T3 and T7 primers labeled at their 5' end 
with M P ATP (NEN, Boston, MA) using T4 polynucleotide kinase were used as 
1 0 sequencing primers. Sequencing products were analyzed by electrophoresis on 6% 
polyacrylamide gels containing 8M urea. The gels were dried and exposed to 
X-OMAT X-ray film (Eastman Kodak, Rochester. NY) for 1-2 days and developed. 
Primers flanking the repeat motif in each insert were selected to minimize hetero- 
and homedimerization; occasionally, the computer program Oligo (National 
15 Biosciences. Plymouth, MN) was used to help in the primer selection. The primers 
were synthesized by the Michigan State University Macromolecular Structure Facility. 

Dog DNA Panel. To check the usefulness of microsatellite markers within 
and across different breeds of dogs, a dog DNA panel was established. The breeds 
to be included in the panel were chosen with consideration given to the diversity in 
origin and function of breeds that exist. Table I presents various characteristics of 
the breeds chosen for the dog panel (Alderton. D„ The Eyewitness Handbook of 
Dogs (New York: Dorling Kindersley) (1993); American Kennel Club, The Complete 
Dog Book (17th ed. New York: Howell Book House) (1985); Clark, R.D., Medical and 
Genetic Aspects of Purebred Dogs (Forum Publications, Fairway, KS (1994), 
Walkowitz, et al.. Successful/ Dog Breeding (2nd ed., New York. Howel Book 
House) (1994); and Lee. M.P.. The Official Book of the Scottish Terrier (Neptune 
City, T.F.H. Publications p. 158) (1994)). Five to ten individual dogs from each 
breed were selected for inclusion in the panel. Pedigrees were investigated to 
ensure that only dogs that had no common ancestors through four generations were 
included for independent representation of alleles. Ten, apparently unrelated, mixed 
bred dogs were also sampled. DNA was isolated from peripheral blood as 
previously described (Sambrook, J et al.. Molecular Cloning. A Laboratory Manual. 
(2nd ed.. Cold Springs Harbor. Cold Springs Harbor Laboratory Press) (1989)). 



20 
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Analysis of MicrosateHite Variability. Amplification of the correct target was 
verified by comparing the product obtained from genomic DNA to that obtained from 
the reference clone. The variability at each locus was tested by amplification of DNA 
from the dog panel. PCR conditions were 35 cycles of 94°C, optimal annealing 
5 temperature (50-60°C) and 72°C at 1, 1, and 2 min., respectively after an initial 
denaturation at 94 °C for 4 min. in the standard PCR buffer conditions described 
above. 100 ng of genomic DNA was used as template in each reaction. 10 //I of the 
PCR products were analyzed by vertical electrophoresis using a modification of a 
SDS-PAGE (sodium dodecyl sulfate-polyacryiamide gel electrophoresis) protocol 

10 (Laemmli, U.K., Nature 227:680-685 (1970)) as described previously (Tas, S., Anal 
Biochem. 188:33-37 (1992)). An HSI SE600 vertical slab gel electrophoresis system 
(Hoeffer Scientific Instruments, San Francisco, CA) connected to a cooling unit was 
used. The gels were poured between 16 x 16 cm. plates using I mm spacers. 1.5% 
acrylamide stacking gels of 2-3 cm were used on top of 12.5% acrylamide 

15 separating gels with 30:0.8 acrylamide to bis-acrylamide ratio. The gels were run 
at 40 mA through the stacking gel and than at 70 mA thorough the separating gel 
until the bromophenol blue dye reached the end of the plates, for approximately 4 
hours. The amplification products were visualized after silver staining with the Silver 
Staining Kit (Bio-Rad Laboratories, Richmond, CA). This procedure resolved 

20 differences greater than or equal to 4 bps in the size of amplification products in the 
75-250 bp range. 

Results 

Screening 1 10 plates resulted in the isolation of 1064 independent clones that 
were confirmed to be positive on secondary screening. Using 600 bps as the 
25 average insert size and 500 as the average colony number per plate, it was 
calculated that 1064 positives reflected an estimated incidence of one CA repeat 
clone every 31 kilobases in the dog genome. 

The first 14 CA repeat ioci for which primers were designed are presented 
in Table 2 together with the optimal annealing temperatures. 
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The complete set of microsatellite markers is set forth in Table 2A below. 
These markers were identified and the primers designed as described above. 
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Table 2A 



Marker 
Locus 




Sra Sequence 


Asn sequence 


PCR 
Product 
(bp.) 


Motif 


COO 103 


CTACTCTTOTATTCCATCAAT 


Al I I iCCCCAl l c I GAcTGGT 


242 


(0TO1 


C001 04 


TGACATAAGCTOTOAOAAGAC 


A '¥ "T"^^ AAA # W W % A T" A A A A A 

ATTGAAAcTGATAGAGAAGAG 


140 


(GT)9 


CO01U 


TACGGAOCCCACACTACTGA 


TCCAAOGOAAOTCATAOAAC 


226 


(G-DII 


COO) 11 


AGCTTCCAGGTCTOGTTTTCCAAO 


TATCCCAGAGCTTAGAGCCTGGCA 


174 


(0T1H 


C00113 


IMIIOAI GGC'l UAA IAAIA 


GAATGGATAAAGAAGATGTG 


12 


(GT>14 


C001M 


CTGC1 1 C 1 CCCTC 1 UCCTATGT 


CTACCACAOCCAATO7TOATT0A 


140 




C00203 


AGGGTGCCT AACT O ACTG AGCC 


1 ! 1CAAAATGGGCI I'lCCITT 


162 


(AOI7 


C00203 


AGGGTGCCTAACTOACTGAGCC 


TTTCAAAATGGGCTTTCCTTT 


162 


(AC)17 


C0G2I5 


JUCCCC1 lAAAUAin 1A1 1 1 


C^TGCATCGAACCTGCTTCT 


127 


(CA)10ACT(AC)12 
(AG>4 


C0O2J7 


ICCTGCATGGAGCCT GCTTCT 


TGTOTATTCAGATGTGCTACTTGGT 


181 


TUA2G(AT4XAT 
3)2(AT2)— (AQ10 
_/OA)16 


C00304 


GCACCACTTGTAACCCTTGAAC 


TCGCATAGGATOATGAATAATA 


181 


(CA)4TA|'CA)12 


C00403 


ATGGAGCCTACTTCTCCCTC 


GACTTGc i u i ATTGu 1 1 ACACT 


123 


fTG)ll 


1 C00412 


ATC AG TCCA J JCiGATTGGCTATC 


GAAAATGGCAGTTGTACCTGAATCT 


209 


(TG)13fTA>4 




A I CACA I CCAAA I CAAG ACTAT 


TO 1 CCT A I GCCTGTC CT ATT AT 


172 


(AQI5 




I UAt Jit ACC 1 1 AC 1 1 CACC II 


AUGGCAACl 1 GGTTAC AGATTA 


109 


(CApT(AC)2C2(C 
A)6 


C00503 


CAGAGCCTTCAGATAACAGTA 


ATTATTCTTTCCC 1 1 1 ICTAC 


230 


(OT79TCTG>4{rA)4 
fTG)7 


C 00506 




AO 1 OCC 1 AAAAC 1 AAC AO AACTG 


173 


(GT>2A(G*79 


C00602 


VrVr/\U\JA/\U 1 1 A 1 UA 1 1 CI AAA 1U1 


OAOCI IOC1 1 C 1 CCC 1 CTOCC 


214 


(AC)7(AG)8 


C00603 


vl J I 1 UC 1 A 1 lul C ACAAATQ 


ACAO ATG AATG AATAC AO 1 lO 


107 


(TG)I2 


C00607 




Tf^A* v TCU i\ 1 II ' H '1 fl'l fill "IV1T A T 

lOCIOOl 1 ICICICI lOlOICIlAI 


169 


(CA)9TA(CA)4 


C00613 


GTCKjACKXTGCTTCTCCCTCTG 


CTTCCAAGTGCAAACACATAGC 


191 


(GT>7(A3T)n 


CO0802 


TACCTGAGTCAGTTTACCTAGCA 


GTTTCTACAGTCAACCAGATG 


185 


<GT>1° 


on/ism 


1 AAOAO 1 J A 1 OCCAC 1 1 G ACC 


CCAGGGAAGAGACCAGTATATGA 


100 


(GT112 


C00901 


T* A A/VlTrrATTPATA/1 A/vtl 
1AAAOVIICCAI iOAlAuAUUA 


lOAICCCAOOAOl ICAl 1CI 1 


105 


(AC)12 


C 00902 


UAUCC 1 OC I J CI CCC 1 CI O 


TGTTTcTT C AATG ACCTTTCAG 


175 


(CA)14 


i COI001 


A1IAJUU IttnAUAA 1 AOCA 


ACCAOAAAC I ICATTGTCTCC 


219 


(GA)12 




ft A A/tTA A ATPl A A A A/* A *Tr A 
VJ/VMJ I AAA 1 VrAACAAAC AA 1 CA 


A A t m tf % A A A A »T A A A /VVrVT/* 

OAAUCAAAAOl ATAAGAOCTJ GTG 


87 


(AC)1 1 




Al 1 CI 1 1CI AIOOC1 AGOCAOl 


TGAG 1 1 1 C 1 CCC 1 C J 1 1 C 1 C 1 


150 


(GT)6A(TG)5A(TG 


CO 12 07 


AGACCACTCTOCTr'^*rTrTT 


Tf^T'f VI HA A AT^AAPA ATP.A 
1 V* AAA 1 U AAV* AA 1 UA 


84 


(GT>15 


C012I2 


AGGTGI 1CICACTCCTCATA 




tit 


(CA)10 


CO 1304 


CTO AGCAAG ACCC ATaCCACTT 


CCTCCrfAfi AAf^AATPTATTTP ' 




rm\TT a rr"c v* 
(IO)/l AflU^4 


C01 303 


GC ATG AG AT AAG AC ACC ACCTGTT 


TTC A I'l'l CCTCiC-CTCCTCtTC* 


IJO 


(oi jy 


C0J4Q3 


GAGGCTGACAACTGTTTGCTA 


GGAG ATAAATG ATG AG AACTCA 


284 


(AT)2T{AT)7CA(G 
A^4_ 4^CAj7(OA/2( 
f*A^ 


CO 1406 


gattttattcatttatccatgac 


CTCCCTCTGCCT ATGTCT CT G 


IV 1 


fpAl|C/|^ A\t£ 

(CA]ID(OA}IO 


C01406 


TGGTG AAAGT AACT AAG AAC A 


tccctctgcctatgtctcto 


150 


/T^A^IA/riAM? 
l^AJIOfOA^I / 


CO 1409 


gttcttccccaatggtattta 


TTGCATAAOAGCCAGCAAACT 


246 


<CA)6A2(CA)3 


C01505 


TCTGCCT ATGTCTCTCCC 1 G 1 ' 


ATAAO ATAPACrt A APT ATTAfiTr 


ivy 


(Grnu 


CO 1601 


CCTGCATGQ AGCCTQ 1 1 1 C I c 


f* ATTTfTfifi A AfS APATAPTtfl/TT A 
V»A 1 1 lb IUUAAUAUA 1 AV> 1 VJSJ 1 A 


145 


(OTJ7 


CO 1606 


atgctgttgattacacagacc 


ATT* ACTTCTTYVST ATTl™ Af* A f* 


1 AO 


(GT)I9 


CO 180 I 


TCTG ATTTTCACCCTT AG AAC 


GCAGTTTTCCTflTPTf'TlTri' 


iji 
i** 


\\ ujiatoi 


CO 1802 


ATGCAAGTTCTAAAACCATACTG 


TAGTGAAGACAGGATTGTGTTG 


137 




CO I 908 


ATCAAGTCCCACATCAGCAOCCT 


AGTGGTATGAGGGGCATAAGGAA 


189 


(OT)I0 


C02005 


OAGTAAAGAAAGAGTTTGAACAAT 


AGTTGGAGAAATGAGCACTTA 


146 


(GD10 


C02L22 


ATGTCAGGCTCCCTGCATGG 


GTTAAATGTAAGATGTC CAGCCTTT 


149 


(CT>4GT(CT)6(GT 
)6fcT)3 


C02401 


CCAGACCCAATOACATCTCC 


ACCCAGGTGCCCTCTTATCC 


236 


«mi8 


CO2509 


TGGCCTAAACACCTCTGACAT 


TGGOATACAAAGTAAATOGAAC 


189 


(CA)18 


C02511 


GACATGATTACCACATTCATC 


GTACAACTGAAGAGACTGACC 


97 


(oni« 


CQ2601 


CTCCCTCTGCCTGTGTCTCT 


TGTrAGTCTTAGCCATTCTOA 


144 


(GT)8<CTP-^(CA) 
12 


C026O4 


CTCACCCAGAGGATGCTTTGAA 


TTAACCTG AG AAC ATGGC ACAA 


190 


(CA)17 


C02608 


AGGOAGCAGGTTTGTGGTTG 


TACTTCTGGTCCAACATTTCC 


110 


fOT)l9 


C02705 


OAGTGATTCTCATTCAAAAAGGGA 


rCAAGGOCACTTTCTACTGTOTA 


116 


fGDIO 


C02709 I 


CTCTGCCTACGTCTCTGCC < 


CACCAGTATOCTGATATAATTCT 


142 


fCA)I8 


C0271 1 | TCTCATTCAAAAAGGCAGATGC 


I'll CAAGGGCAC 1 1 IC1 ACIG 


109 


fGDIO 
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C02712 


GCTTOGATGCTATTGOCTCAA 


CaATG ACTTGGG AAACTACATTC 


156 


(GTO2 


CP7f ft ? 


CCCTGC AT AG AGC CTOCTTCT 


AGC 1 1 1 i GCTTATTATATGCTTG 


186 


(GT)6{CT)2CA(TG 

mcwAnn 


C02805 


G AC AAG AACAGCTATG AGAGC 


TGTTGAGTGTAAGATTCAAAGC 


lis 


(CA)12 


CQ2806 


TCCCTCCTCCTGTGTCTCT 


CTACACCTGTGAAACTACCA 


139 


(GT)1 1GAGXA3T4) 

(CT3)A6(rA2)3(T 

A3) 


C02903 


CCTACATGGAACCTCCTCTTC 


TGTCI 1 1 CCCTC AAC AAGATG 


167 


fTC)4TG<T06 


C029II 


ATCATGGGAGAGOGTGGTAT 


GGGTAGATAAAGACCTGTAAG 


122 


(CA)I6 


cmooi 


TTCAGAGTTAATGATGCTTAGG 


GAGATTCTCTCCCTGTACCAC 


133 


(GT)7(OA)!7 


C03I02 


ACTTGTGTTACCCCTTTTACC 


CCTGCCTTTATGGAGTTTACA 


108 


(CAV5TA/CA)15 


003)04 


TCCCTCTGCCTGTGTCTCTAC 


ATCAATGAAACAAAAGGAACAGTA 


147 


(OT>19 


C03109 


CCT GCATGG AGCCTGCTTCTC 


CACACCAATTAAACAATAGACATT 


183 


(GT)1 6 


C0JJ01 


OCATTCCCAT AG AG AGG AA 


ACCTAGCCAGGACTGGAAAG 


118 


(CA)7TA 
CCAM1 


003303 


TGAGTATTATGACCTGGAGGGT 


TCAGTAGOTTGTGTCTAGCCT 


97 


fGT)UCaG)5 


CO3302 


TCTCAATGATACAAGAACTTCAC 


TCCAGTCACCCTCCAAGATGT 


185 


(AT)1!(TA)8(CA)1 
6 


C03304 


attggcatcattccactggtca 


TGGAGGCAGCTTAAATCTCAACA 


93 


fAC)!6 


C03308 


TG AT AAG AGTGTG AACAG AG AAG A 


CTAGGAGATTGTACAGGTGCT 


273 


(GAV-20 


C03401 


GGTCATCTTTATACCATCAATTAG 


CTTTAATGCTGGCAGATGCTaT 


104 


<CA)10 


C03404 


CAATTCTCTCTATGCCTCTTTGT 


ICriCi IUA1 ICACAGCCAAIC1 


171 


(CTKT(CT>2GT(C 
T)10(CA)18 


C03301 


TCGGAGaTGGAAACTTTTGTAAGAG 


TCT AGTGG ACTGTTC TG AATTTG 


106 


fGTOl 


C03507 


ATCTCGTAATTTCCCATAATACTTA 


ATCAAGTCCCACATCAGACTCC 


161 


(GAP(CA)5TA(CA 
)6(GA)6 


C03508 


TACTCCAATGGCAACAGTTTA 


CCTTAG ACCATCT ACCTC 1 X'J 1C 


no 


(CA)3G(CA)17 


C03509 


CATTCTGCTCATCTCCATAAG 


GGCACaACT AACTC ATTTCTAT 


188 


(CA)15 


C035IO 


CCTGC ATGG AGCCTGCTTCTC 


TGGCT ATTT ATGG AGC ATCTCTT 


156 


(OT)I9 


C03512 


GAGCCTGCTTCTCCCTCTG 


GAGACCATAATTCACAATTCTTC 


113 


<TC)12ATGA2T<A 
3)T3 w .An 


C03601 




TGTl GCTTACCCTTCTGTTAGA 


151 


(CTWGT) 1 0(CT)2 


C0360? 


AO J 1 CCA 1 ALA 1 lAJ I I UwA 


A #1 AAA A AW*T A A ^ * A T 

AOAAAuAUCtT AG ATUCCCAT 


141 


(GT>I8 


C03810 


I IK. i j CI CCCTCTGCCTUI 


GGCTGTAAG ACGCAG A IIILi 


134 


(AC)I7 


C038I4 


ACATTGOO I I CCTUCATGGAG 


GGCAU1 1 1 GGTuATGTCTATC AA 


237 


(TG)I9 


003815 


OiCrCAl UOAGCCTGCTTCT 


AGCTTAGCACCCTGCATGGA 


161 


(CT)6 — (TA3)2(T 

A3 X 1 i A4 XTAJ)4 


C03907 


tagtgctcatgg AGCCTTTC A 


TATCCTGATTCCACCTACCTC 


83 


(GD13 


C03909 


TCAAATCAACTCGTGTTTCTGT 


GGATCTGATAATCCACTTTAGA 


71 


<TG)8 


C039I3 


uAAuGGACAGAGAAAGAAATGAC 


TGTaAGGCCTGTTaCCTCTAATC 


333 


(TOI3<AC)I2 


C04003 


GGviTCTCC 1 1 ATC ACACTG 


AGCaACACTTGACATTaTTT 


133 


(CA)I2 


CD4007 I 


ACCAAATGAGCCACTTAGGT 


CCTCTGCCCTTTCCTCTATG 


109 


(CA)ll 


C04103 


AATGCTGTGG AAGGTG AATG ATA 


ATGGAGCCTGCTTCTCCCTCTG 


224 


fCAWOAM 


C04I0? 


TCAGCAACTATACATTTAAGAGCA 


CTGTCCCATCTAAAGGATAGG 


160 


f(GT)6GA/GT)| 1 


C04107B 


ATCGAGTCCCACATCCTTG 


CATTTACTGGTTTGTCAGTTAGG 


120 


(AG)1I 




TGOOAGATGAAAAGTATCCTC 


CCTGTGCCTCAAG AT AGATO 


250 


(CA)18 




GACTTCC 1 1CI I CCGCATCTAG 


actattcagaaagcagtacaacct 


120 


(GT>6A2fGT>l4 




ATCCTAGTTAGGCATGTGCTT 


ggtaaattacagcaggtoat 


205 


fGA)2(A011 


E2EM 


TGGTTATTACTG AGC AG ACATC 


ccttttgtttccttcaaatac 


168 


(OH21 




AOAACCTATCCAGCTATTATAGTG 


CTCTC AG ATATG ACC AACCT A 


214 


(TO)18 


H'IM'lUI 


ATATACTTTCACTCTCCATGCAA 


AGAAGAGGAGTCTTTGGATG 


HA 


(TO)" 


E2EM 


CAGTTGCTAAGAGGT AGGTC 


GTAAATGATTACCATAATAAGGT 


114 


(CA)13 


C047J6 


TTCTCCCTCTGCCTATOTCT 


AGCACCCTGG TACTGTTTCT 


133 


(CT>3(GT)9(ACTX 
ATC)A(TA3)2(TA8 
XT3A12) 


C04802 


TTACCAAGCTaAGCCTGGCA 


TGGAACCATCACTGAAGGGA 


150 


(C6AXC6TXACJ20 


CO4802 


AGACCACCGAATGGATGGAGT 


TGGAGTAAGTAGCAATCCTCT 


144 


rAC)17 


CO4S03 


CJTTGGTCTCTGGTGGCAATAG 


TGG ACTTOTG ATAC ACCOC ACT 


207 


fCA>17 


C04806 


3CCTCACTCATCATCATTTTC 1 GAACAAGAGATTCATATTTGCTATCA 


ISO 


(TG)18 


C04903 I 


ACTGCAAATAACCTGTAGAGTGCT 1 ACCaATCaCCATTCCCTCATTC 


137 


fAC>!6 


004904 


aAGACTTCACCACTCACAGTCA ( 


rrGGCTCAGTGTOTATGAATG 


143 


f(CA>6T(CA>ll 


C03I0I < 


[TTCTr AACCG ACCTTGACACC 


^GaACTTGCTTATGaAGTCATGT 


108 ( 


rAOI3 


C05102 i 


lAGCTGTGATGTGGCTCTCAAC < 


"AATGGGCAGAAACAATGAGGA 


171 { 


'AO20 


005103 i 


iTTGGCATTTATCTTCATTGT i 


\agaggaaagaatctgtgaact 


196 < 

< 


GT)16T(GT>2A(T 
3)5 


C051I0 I TGGAGCCTGCTTTTCCCTCT > 


ICCCTGAGACCATGAGCTAAG 


183 ( 


CTVfGTO 
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Table 2A (cont) 



WU 


f» AOTTAC AT AT ACCATT AOCC A 
wiu a a nwi 1 Ai awwa ■ i awwwa 


TfirxrrCCTCTTTGTCTCCA 

1 WWW 1 WW Ivll ■ W ■ W * WA 


109 


if AY7T IP ClfC AM 

0 


C07308 


ACATTGCKjCCTAATTTAATAGAT 


GTCCTGGAGAGCTTATAGTAGACA 


127 


/CAM 1 T A/CAM 


007403 


tgccatcttctcatgctcttc 


TCGTGGI I CI 1 C 1 GG AATCTG 


134 


/CAM4/T1AW0 


007407 


TCATTCATCAAGTCCTCAGTTAT 


CTTATGOGCTGOAGGTGTGTA 


121 


fCAH4 

IWrt II J 


C07413 


TTCACAGCAOGOAAACTGTTATG 


ACOCCATCAATCAAGAGAAGTTA 


120 


\w I It V 


007415 




ATTTAATCGACTG AATG 1 ICIC 


101 




C07302 


CATCACGCTCAGACTGTTAGTGTT 

W n • W^WWW I Wrt WrlW • X* ■ • *TIW • X* a * 


■ 9 W 9 9 9 » • 4 r\%J\Jr% 


180 


/fJTYl 1 


C07902 


GACTGAl QQ\ OOAGGOTOAG 


TOTwACCAGTCTGTAAACTAC 


91 




COt 103 


CTTGGAAATGTAATGTU I'Ul A 


CAGTTOTGATA II 1 1 Ol 1 1 1CAG 


91 


ff?A\l? 
iVAH* 


C082O2 


ATGTTCTTAGCCAGTCATAAATC 


TTTOAGOI 1 OGG ATGTTCTCT A 


203 


/fiTtn 


C0S204 


TCATCTAC1 iCCTGTOTAGCC 


GGACATAAGAGGATGTGAGAA 


113 




CO 841 1 


AAGCAGATGCTCAACCACTGT 


G AGG ATCGAGTCCC AGGTC AG 


174 


ICAJIJ 


COM 13 


ACTTAACTAGAGAGOGTGTGACT 


ACCTACTTGCGTGI 1 IIAAGG 


135 




CO 8601 


ATATACTTTCACTCTCCATGCAA 


AGAAGAGGAGTCTTTGGATG 


139 


(GT)18 


C08608 


C AC AG AATACTGG AACTCATTTAG 


AG AA 1 C 1 lATTGGTTCCGI 1 ICG 


155 




C08903 


AACTGACATCAACAGTCTOATAC 


CGACTCTAAG ATCG AG ACCTC 


186 


IvAHO 


C09004 


CTACATGGAGCCTGCTTCTC 

W • AWt f WWnWWW A WW 1 I W ■ W 


TO AAG AGO AATnTJ AATCI ATTC 


138 




C09107 


CCTGCATGGAGCUIIjCI IL11C 


AeAAATAGGTGGTC ACTT AfTG A A 
nww\ ■ nww ■ uu § w^\w • f aw ■ \Jf\*\ 


tin 


Cwl |l4fw 1 )/ 


C09109 


TGfi AfV* A AfpT ATTTTCT AT AAA C 


\#A\ArW 1 WW I 1 W 1 WWW ■ W ■ W 




(III IlO.....(OA>» 


C09205 


CCTCAAATAATGGAAGTGGCT 

WW * Wrtrtrt I rV\m WWrV\W ■ WW 1 


C AATCC AfiTTATG AAATnTTf* AC 

vArt 1 WWr\x« 1 1 r\ 1 VIAAA J w 1 1 WAW 


123 


\\3 111* 


C09210 


GGTGGCTCaGTGGTTTaGCA 

v\J I WWW ■ WrtW ■ WW J J 1 f\WVA 


GrtTGGTTATGATTGTArTTTCTfi 

wv f WW 4 t *\ t V»*» i ■ w I #\W i 1 Ivl W 


149 


1MB 

(wAIIS 


C0921I 


tcacctactgagatacttccat 


CTGCCTATGTGTCTfiCCTTC 

W • WWW t r% t \J 9 W ■ W ■ WWW I i W 


204 




C09213 


tttcacctctgattatatctagg 


TGCATGGAAGCCTGCTTCTC 


140 


Aw |1 B 


C09215 


ccaggaatagacaatgccca 


AACCCTAAOACw 1 1 1GI AATC 


255 


iClAW^ 

WAII A 


C09217 


ctctgcataatgcctoct 


AAGACTATTTATTTATTCATAGAC 


80 


TO) 1 1 


C09220 


CCTACTwTTTTCTGTATTGGCA 


CTwCATAAAGCCTGC I'ICICC 


165 


% • r\i Wrt »o 


C09303 


TCTGTCAATGGATAAGTwCAT 


TCCAGGTTTATTCAAGTAGTTAC 


129 


(CAM 3 


C09304 


CTAGATTCATCCACGTCACTG 


CCATCAACTGATAGGGAAOAT 
ww #» i wv%w t i nuuunnuA • 




(OT)12 


C09303 


TTGCCATCACTGATACAAGT 


TT A 1IICIC11 GCATAAAT AOCT 


181 




C09307 


TTACCCTTGGwTATCTATCTAT 


CIOl lOCATCl 1 1 ICCACCTTA 

w m w • ■ ww^ •wafii ww/^ww A A 


64 


ffiT\*rvflT\ 1 
^w 1 iJwtvi t fli 


C09309 


TOGAGCCAw i 11 C 1 CCCTCTG 


TO II 1 C 1 1 G ATTTGGGTGOTA 


|4| 


\W ■ ^1* 


C09310 


TAG AGG ATC AGGTCCC ACGTC 


GCAGTGCCACGAATGAGTCA 


264 


/rnj l ^nT\t7 


C09312 


AACTGGAAAAATGGATAATCAG 


TTGGAAAGATATTCACATTCAT 


144 


(CA)9 


C093I4 


GTCACTAAATTCACGTTATTGA 


CTTTTCTC AGTGTGTCTC AG AA 


228 




CO9403 


AGATTTOAACCAGGAAATTAGGAA 


CTTGAGAC 1 C 1 CTCl CCTCTUTCC 


182 


(CA)9 


C09407 


TGTTAATCTTCCTAATCTTCCAG 


TCCACTGTTATTGGCATCACAT 


104 


TCAM6 


C094U 


TGGAGCCTGCTTCTCCCTCTG 


G ATCC AC ATCCCTG AGCTG A 


202 


(GT)9 


C09601 


TGGAGCCTGCTTCTCCCTCT 


TGCTTC AAAGG AC AC ATC AAGGT 


138 


\wl |I / 


C09607 


GCTGGTTCTTTCTCTATTTATAC 


ttcaaagctagtcactattagca 


131 


/ri\ii 

\vA 11 J 


C09609 


ACT GCTGGTTCTTT CTCT ATTT 


gotaaatacttgaggaattaapatt 




//** A\f ^ 


CO 96 10 


CTAGCTTGCTCCACTGAwTTCC 


CAG ATGCCTCCCT AA AH ATflTfi 
wn wn i www ■ www 1 aaawA i 1 %V 


163 




C09703 


gcttcaogaatctaggoacaa 


tgtatttcctatgcaatatacc 


152 




C09805 




cacagcaagtgagagtgagca 


156 


(GTJ10 


O09S06 


GTAGTCTGC 1 1 LI CCCTCTCC 


ttctcatatgtggtaactgagta 


208 


/CAM* 


009807 


GC^AAATTAACCTATATTTAGAAC 


aaggcctcagacatgaactataat 


176 


(Ul |DAI(U I Jj 


CO9903 


TCCACATCCI C'l'l A rCTGTTO 


aactcagtgogaccttcaata 


148 


\w t |J A 1 |Vt 1 Jl 1 


C09912 


AAOATGATAGCTTGGTCAAAGAG 


gaaccaggtaattcttctattgaa 


135 


f C A\ft A A ff* A 1 1 A 


C10103 


01 1 wGwCTCCCT ACTCAGTG 


gagtgtggagactgcttaata 


289 


(CA)1 1 


C10104 


GGCAGATTTCTCAATACAGATTA 


tgctctcataatagacgaatcacc 


119 




D0010I 


ACTCTTCTCCATCl CCC1 CI GC 


tcgttggggttaaagctctgacc 


130 


(CA)9 


D001Q3 


GTACTTCCTCAGCTTTCCAATG 


ctccctctgccttgtctctg 


177 


(AT3>4— (GA>4<C 
A)12 


O00I09 


TGTATGCTCAAGGATTATCTGG 


tctctgtgcctgtgtctctggc 


127 


fCA>17 


D0O401 


TGCCCTCACCAGGTGTATAOA 


stgtgaatatgatgtgtctagaaa * 


W 


fCA»2 


D00701 


CCTGCATGGAGCCI ICIi JC 


rGTATwCTCATTAACCATAGTCTT 


150 


rGT)17 


000704 


ATGGGGGAAAGCTGAAGGAGATCC 


rGTCAGACTGATAATAATGC 4 


159 


rCA)23 


D01004 


TCCCTGCATGGAGCCi OCT/ 


oaacccagattccagttgcta : 


146 


[TC)12+fGT)12 


D01304 


TATCCTACCTCTACACTCCrrCCTG 


rGAGAGTTAAGGGGGTTAATGG t 


>89 


GT)20 


DO I 205 . 


AOCATOATGCCCTTCAAGGTC ( 


3gatctttacxxxk:atgttcc : 


E01 


GT)2A2fGT)16 


IX) 1208 , 


ACTCTG ACAAGO 1 1CIOGCG i 


SAGTTTATTTTGGTGwTGTC l 


30 


CAM2 


D012I0 1 


SCCACAACTACACAAATAACTAA 


rTCTACAGTGATOAATOCGAGT 2 


[13 


CA>>10 


D01211 4 


CHJ1 1 1 JU 1 4 CCCTTTAGTGA ( 


3TTCATAGCAGCAATGTCCAC 1 


27 


CA)23 


D01212 < 


ZATAATAATTCCCACCACTACT < 


XJAGCCTGCrrCTCTCTCTG j 


33 f 


CAM 7 


D01214 j 


^TCATTGTAAAGCAACCTCTC | TTCTCCCTCTCCCTCTGCCT 2 


34 ( 


CAU'GAK 
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Table 2A (conLJ 





D012I5 


CCTGCATCG AGCCTGCTTCTC 


ACOAGAOACTCCTAACTCTGOAA 


260 irron 


001504 


CTGCTTOTAU 1 V I AAOTAOOTC 


CTOACTOOOCACAOTOATCTA 


ij« 


(TC>5(CAXTGXT 
AXTO)9 




1 CCAAOOGOTATOTTOTCTATTACT 


CAOCATGAAGGATCTCTOACTA 


157 


focwAon 




1 CT CCCTCTOCCTA 1 XJ TCTCTOC 


TOCAACCAGAATATCAGTTCCC 


430 


fcni6foni9 




1 CTG ATACTCAOTTOCACTCOOC 


CTOOTOACAGAOOCTCAOATCC 


396 


(AQlOAOfACtt 


u:nam 


1 0TAG AAAGCACTO AAOACATO 


ATTIO01 CACAAO ATAGAOOC 


279 


rcmn 




iVOmS | TTACTOAAOTOATACTOTAOCCTOC 


TAACrZTCTCTTGQATGTOAAGO 




(OQ9(AC>5AT(AC 
V 




I'MK'Ji 


1 TTOGGTO ATAATATCTATTOCT 




190 


(CA)13 




ICM902 1 CCTACT AAAAT ACAQAAACO 


AACTCTTAOAACTTAGACATOC 


170 


(ODU 


IDOIOOI 1 GTTCTCATAOAAO0AAOTA0O AOC 


ATA I 1CICI i AGGTTAG ACAGCAGO 


ill 


(ACTIO 






| CI JCIOCATCATCAM1 1AC 


GTAOATATTGAAOAATOAAACA 


tmi 
1*4 


(CA)!7 




1 TCT AAATATOTATATOT ATOCOT 


CACTTTATAACAACATATTCAAAT 


no 

117 


(CA)13 




1 TAAAO 1 1 ICCICA'I 1 M CAOT 


ATCCriClUCTI HTOCCTAATA 




in nil i wwwm 


K'Wl 


| CTG AG ATOTOTCAAAAO IVC 1 1 ICO 


TTOCCTACAAGATCCCTACATOCC 




(GTI15 




1 TTAAGCAO AACCTCCOCTOC 


AATTTTOGTOCCCACTATGGAAGOC 








j OCTCACCACATGA 1 CI 1 i GTATTCC 


TTCTCCTCTGCCTGTATCTCTGCC 


ISO 


/1PIIA 


D02210 


ucxjiciGAArnToncAC 


ACATCAGGCTCCCTTCATGG 




(AQIl(Al>2(AQ5 


D032II 


CCAGCATTACCCTGATAOCA 


GAATAAATCCTCCTGATTGTG 






002212 


AGCCTGCTTCTCCCTCTG 1 CCTT AGTATCCCAGT ATCAC 


213 


/on 19 


D022I4 


AAGATTCTGTOAGACAGGATCAOCG 


ACTOOAGGGAAAGATAGCCAATGCC 


1 Al 

19} 


fTflWA 
f 1 VJIO 


002919 


GOTGCAGTTACTTAAAOACAG 


ATGTOTTOAACACATAOT AOO 






D03202 


CTGTCAGGTCACTOAGATTTAGA 


CCAOOACTATACCCTCCACAT 


1 JO 




D03209 


ACTOOAOTOAAAOGTTCAOGA 


CTOCATGG AGCCTGC 1 IC1 


300 




D03301 


CCACCACACTCCAGOTTCCA 


CACTOTAAAGTAGTTOAACrrAC 


23! 


fCA)17 


D03503 


GGCTCCTCCTTGOCAOAOA 


CrOGACTTTGCATTCACTTTTCAG 


133 




D03601 


GGAATCTGCTTCTOCCTCT 


ACATGTOAGATGCTCAATC 


115 


(GT)20ArrO)10 


003707 


AGAGCCTAOATOCCCATCAA 


TTCACTTAOCGTAATATCCTCT 


156 


fGni9 


D03708 


TTOAAAOAGATAAGGAGTCTOOAG 


TGCAGOTCCG ACTCTAO AOO AT 


•* 


fGT\3A/GT\5 


D03709 


ACAI 1 1 C 1 G AGTGGCATOGCT 


ACTCOCAAATCTTCACAAAGGAA 


BO 


(OT)9 


D03S05 


GTCAACAGCTTAGAAGTCACCA 


ACTATTATOCTGTATGGGTGCAA 


90 


(AC)12AAT(AC)3 
A/AC)2 


EZEUM 


CTAAGATCAAATCOCACGTC 


GATTCGATCTGAGTTAGCAC 


Hi 


(TGWTGW 




ccacccaggcatcccaaoa 


ATCTCAGAGAGTTGGAATCAATC 


190 


(AO!9 




D03I2J | ATCTGCCTOCCTGCATOAAO 


ACrrOTTTTCCCTCATATCTOTT 


141 


(CT)10CTG)3 fT 

A3yTA4XTA3)9 






tacacctgacacttgtatcc 


GTGCTTOTTAGTCCATGACC 


94 


(Aqn 


ID0410I 1 CTGCATGC AGCCTGCTTCTC 


OAATATGATOTACCAGGTGTGG 


171 


<TO)l6 




WAVAm 


CCCAGGCACCCCCTnTCIC 


ATCAAGTCCCATOTCAGOCT 


179 


(CA118 


WjIVAM 


CTATTGATTTTTOCAAAGC 


GTCTTTCATG II 1 ICATATACTC 


130 


((GT7I5 




ACTAGAAOACAOCAAAATOA 


AGGAATCTOCTTGGATCTCT 


176 


(AOH-(OTU 


!»:■!»■>■ 


OAACCfGII 1CIXXCTCTOCCI 


GTCTCTCCCrJIUCCTCOTAO 


138 


(TG)I7 




004304 j GCAATCTATTAOTGOGGTCAT 


CTOACTCACAOCCTOAAATOTAT 


224 


(TG)14(GA)3GC<G 
A* 


1 




TTGTCATTGAGOAGAGTCAT 


CCACTCCAOAATGTATCTAAAC 


96 


fCA)5TA/GA)S 


004517 


TTGACTAAGGOACTCTCAG 


TGOOTOGCTCAGCAGTTTA 


254 


(OA)3<CA)I0(GA) 
14 




004606 


ClbCl IC1GIC1CIGCCTAAT 


TCCCTCTGCCTOTG1TCTCTO 


280 


(CT)10_fCA)13 




D04609 


AGCTATCTCTTCATTTOATCTATCC 


CTAGAAGOACAAGTGTGTCTACTGC 


223 


<to>ioag<tou 




004610 


atccaaagacaattcaaaoo 


TTGGOTCTATTTCTGGGTTCT 


133 \ 


CGDIO 




D04613 


atctcactcaoaocoaaagct 


COAGTTOCAAATCTTACAGG 


293 


fGDIOMTWACtf 




D04614 


ATCAAGTCCCACATCOOOCT 


(JIUUZ ILTIAICLTITCTCTTATC 


154 


fCT)12(GT)12 




004616 


TCTCATTCriGl 1 1ATOOCTOT 


ATOCACCCTTATGTTTATTGCAO 


167 


conn 




004617 


AGOATGAGGTAGGAGTCAOAA 


GCTATGCTTTOGGATOAOGTO 


271 


(oni4 




004702 


GTCTTCCAAOTGOTAAOAGCCTACC 


ATCCTCCTCTACCCTCAOAOCC 


112 


(fCA)12 




D04710 


TCCCTGCATGOAOCCTTCTT 


CATTCATTCTAACTTGAOTOTC 


26 


(GT>17 




ooaio 


CTCCTCTOCCTCTOCCTOT 


ATOAACI CTOCACTTOOCGT 


231 


(TO)14 




D0481I 


TCAAGTCCACATCAGGC7TC 


ACGTGGTGGTATCAAOTCTCT 


lt9 


rCA)19 




D04812 


TCCCTGCATGGAAOCTGCTTC 


ACTCGOTTTAOTTOOAC IUCJi A 


190 


(TO)ll — (Ami2 




O04IU 


TGCAGTrCAOTAAAGCAOOCTA 


TOAOTOACIOIOT JCIA1CI IV I 


122 


CTO)I0TAGTCCTO 
*TCTAfTGT7 




004907 I 


TCGATTGAGCCTOCCAAATAACr ( 


CX^TCACOCGGAGTCTGTAAT 


116 


fCA)13 




D049U 


rGATAGACACTTGGG TI OCI TCCA 


ACTCTTOOGCATITACTOCAAGOA 


164 


[AJ>5(G1XAT)5C 
rAT>7 
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Table 2A (conL) 



|i.'«M»M:M 



ACATCGGOCTCCCTOCAT 
TCCCTAT ATGO AGOC T O CT 1 CI 



GAAACTTCATAGGCAGACAAATG 
AOOCATCAOOAAATATTGTOOOA 



DX-4 



ACTCTOCTOTATAOACATCrroT 
ACATCAOGCTCOCTACATOO 



ACCGTOCATGTCGCCACA 



1 232 



O AAGCTCCTATTTGCCT7TC ACC A 



1 200 



AAAGTACCTATGGTTGGAOCATA 



136 



AGAAAACACACCCAOAOACAGO 



1165 



AOCAOAGGACTATOOOAAATAAC 



108 



CTCACTCAGGTTACTTOGCTGC 



1170 



(CA)17 



(OT)16(OAttl 



fTG)13 



fCTWOD? 



E00402 



TCACCU1 J I lAOCCAGTATTCC 



TCCATTOCTG ATCO A G i IViO 



1212 



<ATT05(AC)3(AG 
ill 



E00409 



TOCI 1 1 J GGATOOAGCTGAAO 



TGAOAGGATCAGT7TCTOTTO 



1211 



£03906 



AOCAGTOAAGTTTAATOAAATAC 



OCTCAGGAATTACCAOAOQAO 



£03909 
£03912 



AGCACTTACAGGOTGTOGTCOTA 



OACTTCCCAGTTGACTAAATAAGCTA 1214 



1S2L 



TGTOOAGTCAOCTTCAOATTC 



GCTAAACCACTGCACCACTOO 



[130 



£03913 



AAACAAGTOGGQAGGGOAOO 



CTTOATCOAQCCCTGCATTOO 



L1I7 



fAGHC(GT>7 



E03914 



TCAOTCCCACATGCAG C1 IUO 



CTOAOAOCAAATTGTTATTOTAA 



1202 



(CT)\6(OT)8 



B03917 



AGGOAOAACAGATACTOACTCAA 



TAATCAGCCTCTAAGOATTCTGG 



(216 



£03920 



CTCTGTOAAGCCrG Cl ICIC 



AGCCAGTCATGTGCCCTTA 



132 



CT>9GaC)3 



E03922 



£03923 



CACATTOACATAAAAATAATATOCCA 



CTOC ATGQ AGCCTG C I IC1 IC C 



CAGTGCATGGAGCCTGCTTCTC 



E04001 



TCAGCATGOAATCTACTTOAO 



GT7TCAOCATCTGCACCAGOAT 



[192 



(AG)17 



OAATOTOAOTACAAAGGTAGG 



ll llll II I 1 — 



1 76 



(cnn 



£04007 



OCTCATTOTOATTCCTTAAAACAG 



CTGGGGTCCGGGATGOAOT 



1202 



(GA)3AfAG>13 



E04QQS 



GOTAGCCT LfCl IL ' IC CCTCTO 



aocagtgattcccttcacctg 



143 



fCT)12fOT>3 



E04019 



OCCCTCACTGGAC ArCTI i ATT 



TGGAGCCTGCTTCTCCCTCTG 



116 



<OA)13 



E04021 



CAG7TT GOAGTC1 PC 1 1CICCCT 



ATCACCTGAATTGCAGTTOTCA 



[182 



(CT)10 



£04104 



ACTAGGCATCTCACATACATTATT 



CCTGCTTCTCCCTCTGCCTAT 



1 109 



(AO>» 



E04103 



CCTGOAATGOAGCACCATOTC 



ATACTTATGTCCCTGGCTCTG 



1168 



(CT)8C2T2fCn6 



KM 107 



CTCCCTCTGCCTATGTCTCTO 



CCAAGCAGTTTTACCACOATA 



1 110 



E04108 



CTTCTCCCTCTOCCACTTC 



TCTTTATTTOACAOGOAAA 



198 



(CQI0 (CDS 



£04401 



CCTGOCATOOAOCCTOCTT 



OTTT7TAGGTCTACACTTCTOAGT 



1122 



(CT>9(OTO 



E04402 



TGAATCATTATGGTCCTATCGTTC 



TAAAATOCAAQTCTTACCAOAGGAA Mil 



0013 



E04403 



TOCATGOAGCC1 G CI ICI C 



nTTCATTOAATATCTGTCAT 



1123 



GCCACATAOACACTTGQTGTT 



CGOGATGGAGCCTGCTTCTC 



1114 



(0^12 



E04408 



GOAGCCTGCTTCTTCCTCTO 



TGCTTCTGGAAACTGCACAT 



CACTAGTAGCTTTATAATTGTGCT 



124 



TGCATGGAGCCTGCTTCTC 



[144 



(CT>14GrrC>4 



(AG)U 



AOCcrGCTTCTCCCTCCTC 



rTTTTAGTCT AC ACTTCTG AOTAA till 



fCT79fTG)3 



E044II 



CAGATCCAATCCCACATCAC 



TACTCTTCCACCA7TTTGCC 



166 



(CD1I 



TCTOCCTATOTCTCTQCCT 



TGTATGTCTATTTTTGTGCCAOTA 



164 



G00402 



000410 



OTTTGAACCCCTOCCATAGGTA 



TGGAGCCTGCTTCTCCCTCTG 



CGQAATCGAGTCCCACOTCA 



GCC AACTCTTTAC ATCT OTGCT A 



1 175 



|148 



(CAWCAUO 



1S32U 



G00301 



ATOCCCACCTCAGGTTCTCTG 



GTTG7TCCAGTATTCATTCATTC 



1171 



S£D1L 



000304 



000508 



CCTGCTCAGCAGAGAGTCTG 



ACTGCGTGGAGCCTGCTTCT 



GATTGGATTATTTGTTCTTOG 



GATGTACTGGCCCATCATTCT 



161 



[196 



(CPU 



(CT)14 



G00602 



CAGOGCTCAATGACTGATGTTA 



CGAGCTOCTCAACCOCTCAAC 



tcaagtcttgcatcgcacacc" 



TGOAGCCTGCTTCTCCCTCTO 



(138 



|187 



XCA213_ 



(OA)19 



TCICCCTTTGCCTGTGTCT 



GTCTATG AG AGC ACC AGGTTCA 



|I90 



(CT)11 



CTTCTCCCTCTGCCTGTGTCT 



AAGTTGTGTATTGATTTCATTCTG |206~ 



G007O4 



GGTCCTCTGAATCCCTGTCTAT 



GTGGAGCCTGCTTCTCTCTTTG 



1223 



<TC)6T3(TC)7CA( 
CTflCgCE 



(CT)9T(TC)3_An 
17 



CTTCTCCCTCTGCCTATGTCTCTG 



GAAGGCTTAGCAAGAGTTGAAGA 



189 



O007X2 



CCTCTCCCTCTGCCTGTGTCT 
ATCGAGTCCGACATAGGGTTCACT 



ACCTCTGAATCAGGAAATGTAACT 



132 



(CI)13GACTATC 
A(TA3)2(TA5XT2 
A10VTA6)2 



ATCGACTCCCATGTTGGGCTCC 



AAACAGTGTAAACAACATGCTACC ITST 



nan. 



TGAGCAGGGGCAATAGGAGACTTC JST 



(CD12(GTWCT>3| 



(CT)9G(TC)3ATG( | 
A2T2XA3T)2(A4T)| 
(C2T2A8) 



CTGGATGGAGCCTGCTTCTC 



GCGTATCTAGTGATGCCACTTCT Jw" 



(CT)10T(TC)3ATG | 
(A27XA3TWA4T) 
CT4A4) 



TGCTTATGCGTACTCICTCTCAA 



TCCCTGCATGGAGCCTGCTTC 



1184 



<CT)J2GAO<TCp 
ATO(A27XA37V( I 
A4TVCT3A9) 



CTCCCTCTGCCTACGTCTCTG 



AGAAGTTACTGTGTCCAAGTACAA TTJT 



(CT)17(GT>4ACTAj 
TCAT(A3T)2(A4T 
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Table 2A (cont) 



ocmi2 



CTGCTTCTCCCTCTGCCTCTATC 



£01006 



TOCTTCTCCCT Cf 1 1 CI GTCT 
CCTGOTGCATOOAGCCTOCTT 



aogaactgocattctacattacca Tm" 



ATTGTGAAAATCCCTCCTTAGAAAT 1 142 



OATGCTGCAATGAACACGAGAQCT 



TTTATTCTCCCTGTGTTCTT 



(CT)1I(GT)3CTCA 
TO(A2TXA3T)2(A 
*T)C<T3A6) 



(TC)1 fTATCA(A3 
TfflArniAlO 



AGCCCAAGTTATAGACAATGAT 



(CT)ltC2T2(A5G) 



aoccatttgtatgtcttctttcta u5T 



(TC)17(TG>3TCTC 
3ATG(A27XA3T)2 
(A4T) 



GTGCTAGATGGGGGCTTCCTC 



CTC)17GTG<ArrX 



TGCrrCTCCCTCTGCCTGTOTC 



(TC)16 (JA3)2( 

T2A5XTA2XT2A! 



ATTTCOAGTGTTTCTTCGAGAAGTT 



(CT)13 — (A2TXA 



QQ1309 CTAATGTAACATTGTGTGACAACTACA 



gaaatccacattatatgaggttaaac 1i5T 
catggagcctgctactccctct |tto" 



CCTTGCTCACCATATCACACA 
TGCTTCTCCCTCTGCCTATGTCTT 



TGGOATGGAOCCACAAGTCA 
CCACTCCCATCTCTGCTCAT 



TCTTTCTCTGCCTCTGTCCT 
GTGCAOOGCTCAATGAGTGATGTT 



aC)I6fGTO 



CTTACGACTGTTTTCTCAACCTG 
CCAACOACTOAAOCTGTCAT 



fCDlO 

(crKCA/cn6 



TCATTGCCAGACCAGGTGTC 



QGGcrrrGccTcrrccTCCTTT 



GTGCATGGAGCCCGCTTCTC 



O0I713 
G01713 
O017I7 
001804 
001807 



E3HM 



AGGGAAGACCCOTGACCAT 
ACTAGAACTACAGATCAOTCC 
ATGGAGC CJbC I I CIC CCT 
TGOAGCCTGC 1 ICJC CCIC l 
CCAAGG ATC AAG AACCACGT C 
AGCATCGAGTCCCACATTGG 



OCTTCTCCCTCTOCCTOTOTC 
G AG AAC AATOGC AGTT OTCT 
GOGGTTGCCTCTTCCTCCT 
CTGCATTTCCCTGA7GAGAT 
GATGCACTCTCCAGTTGAACTA 
TCAG TTAG AGC ATGAATCTTGTC 



O0190X jCTCCCTGCATGGAGCCTACTf 



I 



IG0J906 
QQ192Q 



TATGAG7TGGGCTCCTGCTC 
AGTCCTOTGTC AGGCT CC AG 



CTGGGACAGTAACACACATTAGT 



(CT)2GC(CT)12TT 
rCTWGTM 



I AATTAGCAGGGAGTCTGTTTC 



J TOAACCCTGCTTCTCCCACT G 
IGAGTCTGCI 11 JUCC1C1U 



t 



j TGTCTCATTCTAGCTGCT ACATT 
J TGGAACATATCTTTTTGGGTOACC 



} AGQATCATTGOCTAG ACAAAC 
J TCCCTGC ATAGGOCCTOCTT 



{ CATGGAGCCTGCTTCTCCCTCT 
I CTGCCCAQAGAOAGTCCTOCAT 



I CATOOAOCCTOCTTCTCCCTCTG 
I ATTOO AACTATGCAGOCTat" 
| GOATCOAGTCCTGCATCOAG 

I atcaggctcatcccgcatcag"' 



lOAOCCTGCTTCTGCrrCTGCC 
j TAOAOOATCGGGT0COOCC7C 
OCAOAAACATACACTCAOTAOG 



ATAGTGCATTCTTTTCAAGGAC 



ctagagttctctcaaatctgtca 



GOT ACTTGGG TTTT AG AAT AT 



ACGACTTGAGCCACCCAGGTA 
CTGTACACTCTAAATGGOGTCATT 



CTTCTCCCTCTGCCTGTGTC 



CCTGCrrCTCTCTCTCTGCCroTC" 



TACATAGTTGOOATCGAGTCC 



OAATAAAACCTAGACTGGCTOAAG 



(TA)6 

(TC)U(GT)2 



{CTHT2(CT)5 



(CQ9 (Of\2 f CTp 
(CT)9(A3T)2CT2A 



(GA)U 

<03J23_(CA)6(0 



AAOGCAGATGCTCAACCACTGA 



TCGAATOCCATOTCOOGCTt 



AO AATATCTTGGCT GCAATOCT1 



CCTGTGTCTCTACCTCTCTOT 
CTO AOCC AAAGGCACTC AACAC 
ACATAAGGAAC7TCTCCATOCAT 



GCCTATGGTCTTATDGOTGTTCC 
TTCACATGGTCTTCCTTTTOQfrr 
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Table 2A (eont) 



rt?.W:T.V 



1002813 



ACCCAGGTGTCCTTCAAAATCT 



AT0OAGCCT0CJ ICTCCC1U1 



GOC ACGTTAGTCTAo 1 1 n c : I G 



CCAAATTCAGOATTTCTGACTCC 



GCAGCCAATATGACATCATCC 



TGCATGG AOCCTOC 1 iCi COC 



TAGCTGTOAOCTOOGTCrroOA 
CGAGGATCGAATCCCACOIC 



nrxrmm 



GQ2902 



TGCTGCTTTATAGTAAAAATO 



TCCTOCTGAATATCACGTTCA 



ATCGAATCCCACATCAGOCTC 



ACACTCAGCATAGAGTCTOCrrO 



OCTGCACAOAGCCTGCTTCTC 



GATTGAGTCCCACATCAGGCT 



GCTCTCCCTCTGCCTGTGTCT 



TCAGCTATAAATTCAACTGGCTTA 



TAATCAGg i I CI I GGAOATQAC 



ATGG AOCCTG C MCI C CCTCT 



TACATOQ AOOCTGC 1 1 CI C L C 



OAACAOCTTTTGCAGCAOCC 



OOCACTTCACTT AATCTTTO AGT 



TCATTTGTCACTTATTAOTCCAC 



CCTGCTTCTCCCTCTGOCTAT 



AAGGGAGGGOAAACGACACAT 



CACAAATDTAAACTGOOTATATT 



CACCAGGTTOOAAATOAATAAG 



AAAOCACTGAGCCACCCGOACT 



AGCTGTGTTTATGACTACACATG 



206 



151 



139 



161 



161 



173 



114 



174 



265 



154 



177 



154 



153 



241 



(GA^9 



(CT)14(GTyCT>2 



(OA)8AT(GA) 



fOA>12 



(OA>8 



IEQ1L 



(CT>7 



<CT)2GC(CT)9(GT 



(CA)5(GA)12 



(CT>15 



(CT)9 



(CT)14 



(CT2TG(CT)6 ( 

CT>S 



C02903 



TAGAGCCTGCTi C 1'CCCTCTG 



CCAATTTGAAGGATTCATCATT 



146 



<CI)2GT(CT)7GT 
ATfCTfS 



G0300I 



TCCATCTOCCTATCACACCACT 



TGAGCACTGOATGTTATATGCAA 



199 



q-c>9 



G03006 



ATCTAATCCCACATTGGCCTC 



ATOGGGAGTCATCAGACCAGO 



171 



G030U 



TAGGCTTCCroCTCAAOACAO 



OOATOOAGGAGAGGCTTOTTA 



209 



CGAT)6„frC)9 



G03012 



CTGCICIC1 1 1 ICGCTCACTC 



MCI CCCTCTGCCTGTG1 CT 



141 



(OA)17 



G03013 



ACTGAGATGGGAAGGGGCAGA 



CTACATCGGGCTCTATOCTC 



S3 



12A2L 



G03016 



O AGCCTGC 1 1 C 1 CCCTCTGC 



AGTCCTGTGATTAGTTCTCAGAC 



106 



(cnio 



G030I7 



TCCTCCCAACATTCTACAATOAA 



134 



(OAUCA(OA)9 



GO30I8 



TGCTTCTCCCTCTGCCTGTGT 



CCTTCTGGATCTGC I ' 1 11 ACT AT 



203 



(CT)13 



GQ3019 



OCACTCAGATCTCCCTATACTAT 



AAACAGG ATCG AGTCCCACA 



212 



(OA)13 



G03104 



TAGCAGACAAACCCCAACTG 



GAGCCTGcmciCCciuiG 



167 



(OA)13 



G03I09 



CTGCATGG AGCCTGC I 'I C 1 1 



TCTTATTCAAATCCTCCTGA7TAT 



153 



(CT)9 



G03II1 



CCTGCATGGAOACTOCTTCT 



T GI nC CTCA CI ICI I ACTOA 



218 



S££BL 



G0360I 



GACACCAGGTTGATTATCATT 



TGOAGACCTGGGATTGAGTC 



166 



(OA) 10 



G03901 



ATCACACCCTGGGCTGAAOO 



TGOAGCCTGCTTCTCCCTCTG 



174 



(GA)14 



GO480I 



AGCATGCCCAGTTACATTTOAA 



TG ATG I'll G ATGTTCACGTTG AT 



208 



(OA) 18 



G05002 



CACTGTGTATGTCCTCTTATTAAO 



C AGG AGTCT ACTTTTCC M CI G 



170 



(OA130 



G05602 



CACTAAACCACTOAACAACCT 



GTCCCACGTCAGGCTCTCTG 



138 



(OA)9 



G03602 



CACTAAACCACTGAACCACCT 



GTCCCACGTCAGGCTCTCTG 



158 



(OA)9 



G05604 



TGCATGGGGCCTG CT rCTC 



CCTCTTCATACTTCAGCAAGTG 



169 



(OT)9 



G06202 



CCCTTCTCTGTCTTTGAGAGT 



AGCCTGCTTCTCCCTCTGCC 



144 



I c 1 1 1 1 CCCTCTG ACTOTOTCT 



<GA3)C(AG)9G(G 
A)5 



TCCCTCAAAATTCAACATACAA 



168 



I CCTOCTTCTCCCTCCTCCTG 



TCCACAAAGCTCCCTACTCAT 



163 



(CTH KG Att(CTV2 



I C ACTGGGCOTGTAACCTGCT 



CTG AAATGT AAGTGC AAAGO AA 



172 



(cpio 



I CT AAT ATC AAAAGGTTATCCAC 



CATC I ICCTCTGCCAGTOTC 



IG06221 I 



267 



<CT)12..(A3C)8 



G06221 I GG ATAACCAGG ATAATTTCCTAC 



AGAGAGGCCCACATCAGOCT 



156 



(AT4XAT3P(GA)1 



Iciuci iCTCCCTCTGCCICl 



AATTTATGGAAATGTTCCCAA 



150 



lOAGC ClG Cr iC lCCCTCTOCC 



ACCCATOTATGAGCCCATTQA 



137 



(CT)17TfTC)3 



\ CAGGTGCTGCAAG AGC7TAGA 



CTTCTCCci i i cOCTCTOCC 



176 



CT)19 



( GTCACOTCTT CAACCCTTCCT 



ATTGAGTCCCCCATCAGGCTT 



215 



(QA)17 



( AGCCTGCTTCTCCCTCCTC 



CCACACCTCACACCGTOTA 



125 



(OA)U 



I ACTGGCAATOGGTCTO AAAATAG 



CTCAOTT AH m iGOO C ' IU 1 1 



216 



(CT)13 



rCTGTATCTC 



I ATGAATAOCTTGTGCATCAGTGCATT 



I CCATCAAACTTTTACAOTOAA 



J CAATCAAACTTTT ACAOTO AA 
f OTT AGGCTCTCTG 1 ' 1 CAQTOQ~ 



I TGTGOAAACTGCTTACAATTTTC 



I TCGAATCCCACGTCGGOCT 



1 CATTC AO ATGCGGG AOTTTC 



ICTTCACAAOGTTOCACAAQAG 



\ TCATAGG ACACTTAOCAAAOOCT 



t ACAACCTACAGAATOOGAQAA 



I GCTTTTCACCCAACOACTTAOA 



| CTTCTCCCTCTGCCTCTGTC 



| CTCTGC CTGTGTCTCTGCCTC 



CAGOTCCCCCTACACTAAGTO 



133 



<OA)13 



TGc i I c I CCCTCTGCC 1 01 U I 



132 



(OA)10 



GGGTCIOCl ICTCCCTCTCT 



163 



OA)13 



(GA)12 



COOTOATACC I ICCTCATCAT 



146 



(CT)9 



TOCCTTAOCTTACAAAOTTATTG 



158 



(CD17 



ATOTTACAATOATCTOATTTATTCT 



236 



CCAOGTOAGO'I OCAo I imC 



211 



(CT>5GT(TC)I2 



159 



fOA>9 



(OA)l4 



194 



Ci iccACAGCii 1 1 TAum 



196 



(CAPfOA)12 



AACTCTGTGGCTCAGCAAGG 



211 



fcnio 



GCGTCTATAATCATCAGAAAT 



159 



(OA)12 



CTATACACATTGAGAAATGGCA 



168 



CT>ll(GTM 
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Table 2A (cont) 



G067O* ATCGAGTCCGACGTCAGGCT 



TTATTT ATTT ATTCATAG AG ATCC A 



98 



I(CT)13_ATG(A2T 
XA3TE(A3T) 



00*70? 



OOTOCATOOAOCCTO CI I CI 



TOCCAOTTCAGTTTTCAAAGTT 



147 



(CrH7fOTO 



O0*7J0 



TTCCl API 1 rCTATTCTOCTC 



AACCCGGGATTGAGTOCTO 



1*7 



00*713 



GAGATCGAGTCCCATGTCAO 



CTTTOAGGAOATAAATCTTTCTA 



225 



<TC)20 



00*714 



ATCAAATCCCACATCpppCTC 



ATTAOTTCAAACCTCOOCAATO 



G0*713 



TTGATCOAOTCCTACATOOO 



163 



TCTTOOGTAAACTACTTAACTT 



174 



ooia 



00*717 



TPCAOCCTGCI IC1C0CTCT 



CCTTATTCAOATTTAOCTGTTTO 



147 



fTCWTOP 



00*801 



AGOGACCTOCl ICICCI ICIO 



CAATGATTATGGTTTGTCAACTT 



00*805 



GACACCCAACCGCTGAGCAC 



OAOCCTGCTTCTCOCTCTOCC 



1*2 
1*8 



l(CAWOA>9 



00*901 



OGCAGCTTTOATOACTOATTTGA 



AGTCCTOTGTCAGOCTCCCT 



211 



I (OA) 17 



G06908 



OOAACACGTTAATTCATAAAAATGAT 



oocw 



GGCAGCACTAAACCACTGAO 



ATGGAGTCCCACGTCAOGCTAC 



205 



TGCTTTGCATCTTCCAUTI 



209 



{(OAPCATOAHO 



<OA)15 



G06910 



CTOTGCTCAGCOOOOAOTCT 



TATCTTAOAGTOATGGAOAGTGG 



101 



00*914 



GG AAGATGTTOTCTCTTATCA 



oooTAOOoon n gtttatoo 



159 



|(CA>20 



007001 



CTACATOOAOC CIG CI 1 CIC C 



TCOCCACAACnTATGTCCTC 



129 



fCDlUOTM 



GO7003 



CC1 1C1CCCTCTGOCTDTO 



GCCACTOATTTATTCTCTGTA 



197 



(CPU 



007004 



TOCTTGCTCTCTCTCAAATAA 



OTGCATGO AGCCTGC I IC1 



181 



fOA)10O(OAY» 



007005 



CCCTCTGCCTOTGTATGTCTC 



ATOOCAOCAGOGAGTAGTCCA 



134 



O0700* 



CAGTGGGOAATCTGCTTOAQ 



CATTTCACTAC AT AT ACA GGTGTC A 



150 



Kcnn — rcnio 



O07007 



AATACCTGGGTAAACATTTA 



GOGATCGAGTCCCATGTC 



156 



GO7008 



OTGCATGO AGCCT G C 1 IC1C 



AATGTACCTOTCCCU 1 1 1G 



127 



hEQ13_ 



GO7301 
G073O8 



GCATTCACCCAATAGTCCTTG 



GCTGCTTCTCCCTCTGCCTAC 



135 



f(OA)13 



TCTCAATTTOAAAAGTTTATAOTC 



rrCTCCCTCTCCCCCTATC 



174 



(TA3)9_(GA)5_ 
\(OA)l 



007310 



TATGCTTCTCCCTCTTCTCTO 



CGTTTCTCTCCI I G ATTTGT AAG 



59 



RCQU_ 



007312 



CTTCTCCCTCTGCCTOTGTC 



TGCTAAACTCAACTCTCCTAA 



123 



ccni4 



O073I4 



CCATCAGTTTGTTCTCTATCA 



OAAGCCTAAGTCAGGAGTAG 



224 



kcnu 



G07402 



OGAGeXTGCTTCTCCCTCTO 



TATCGTGCCACACTGCTOAAT 



244 



(CT>llT2<Cn5 



O0740* 



AATTTAGTCGAAQAATGAAAGATG 



G AAATAGCCTT AAAAGC AATOT A 



221 



(QA)14 



007407 



CCACCTCOQCTOCACTOAAOA 



TGGAGCCTGCTTCTCCCTCTO 



134 



l(OA)I0 



007408 



TOTCACTCTOCTCTCCACCTO 



AGTGCCTAAAGTTCTTCCT ATTG 



135 



EEQ!±. 



007410 



A T C T C C1 ICIOCACTCCTOCT 



CACGTAAGGGATOAOTTCAOCT 



147 



f(TC)8<TOtt 



007413 



CTGOAACAOAACCCACAATA 



ACG AG ATCAGTCC C AC ATCAG 



231 



(OA)24 



G07414 



TCCCTG AAA GGGGC ATTTAAG AC C 



AGCCTGCTTCTCCCTCTGCCTATG 



128 



G07420 



TC AGG AGGTG AGTTGCTTGG AG 



CGGTGC ATGG AGCCT GCTTCT 



1*2 



|fCA)3(OA)l6 



G07502 



CTCCCTCTGCCTATGTCTCTO 



ACAOCCCTOTTTACCG AGGTG 



255 



RCQH 



G07503 



CAGGAAACTGCTGGACTTGTOCT 



OC 1 1 C I CCCTCTGCCTGTOT 



126 



1(GA)15 



G07504 



AGTTCTGOAOOCTGGOAAGTC 



GGTGTGAAATOGCTCTTTAGATA 



215 



(Cf)23 



G07505 



TGCATGOAGCCTGCTTCTC 



AGCAGOTTACTCTTAGTGACTCC 



138 



kcpn 



GO750* 



ACTTCTCCCTCTGCC IG ' l G T 



TCCAOTGTATOTTGATTGAA 



124 



007507 



ATGGAGCCTGCTTCTCCCTCT 



OTTTCCTGCTCTCCTACCTGG 



163 



G07508 



AGCCCIUCI MCTCCCCTCT 



OATTTTGATTTACATTCACAAOTACA 



98 



qo>io 



GO7510 



AGGCATCCCTTACTT ACTTACTTG 



TCCCACATCAGGCTTGCTCTAT 



152 



007701 



TATTCAAGCCATTGACGCATTO 



CATGGAGCCTGCTTCTCCCTC 



247 



(TC)2CrC)2GCCCr| 

Iqi* 



007703 



CTGCTTCTCCCTCTGCCTATG 



TTTCCAACATTATOCTATGAT 



198 



007704 



AGCCTGCCTCTCCCTCTCCA 



AOAOTCACAATGCAACCCCACAA 



24* 



l(TO24 



G0770* 



GOTOACACTATACTGAACCTTCT 



007707 



CTCCCTCTGCCTOTOTCTCTO 



TCrTTCTCCCTCTCCCTCTGA 



AATTTTTATGTGTCCTGGTTCAOCC 



1* 



202 



U2H1L 



1<CT)9 



007709 



CATTTCGCTCATOTGCCTOACTOA 



CATGGAGCCTGCTTCTCCCTCTCC 



147 



|(QA)I6 



G07710 



GCTTCTCCCTCTOCCTCTATCTCT 



ATTOATCCCGGATTTTGGTAATA 



175 



Fry 



G07711 



TA G1 ICI 1 IC TOCCC1 1 U1C C 



CATTTCCAATCCATTTAOAOA 



149 



007712 



CTGCATGGAGCCTGCTTCTC 



TCAOACGCTCAACCAACTGAO 



179 



UCT)9 



007713 



CTTGAAGOCOGCTGTTCTTO 



CAGCATGGAGTCTO C 1 1 U I C 



TTGGACTTTCTCTCCCTCTCCT 



234 



GGGTAGAACTOACATTCTTT 



AGCTAAACATTTAACCAACTOAG 



219 



}(OA)l* 



1008002 



G08005 



I CGTATGGTCCTOGAGACCTO 



CTGTAGGGAGCCTGCTTCTC 



133 



(CT)14 



CT AATTGAGGAG ATAGG ATACATAAT 



153 



(OA)7CA(OA.8 



(CDI7 



< OTCAGCTTAGCCATTG AAOAAT 



t GTCAGCTTAGCC ATTG AAGAAT 



CCTGC1 IC1CCCTCTOOCTC 



IGGCACAACA CTCTG AATT ATT AG 



CCTGCI i C I CCCTCTGCCTC 



174 



GGTCTTCACTGCAAGGOAACT 



CACTCATrr ATTOCCCTA C 111 i A 



174 



(CA)2(OAH5 



175 



|(CAP(OA)13 



CATCAGATACTCCAACATTCAG 



190 



I (OA) 15 



(OA)20 



G08007 



CAGAGTATCCTTGCCTGTAO 



CTOCCTGG AGCCTG C! ICI 



139 



<CAV3(OA)12 



G09201 



togtactgtagctttgaagat 



tctgtgaaagacaccctattta 



173 



l(CT>14 
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Table 2A (cont) 



H0330I 


TTGCCTTCTCGOTGTATTGACTT 


GAATGTGGTTAGTAGAATTATACAG 


300 


fAT3)IO(AT2)2AT 


H03302 


GATCCTGATTGTTCTTOAG 


GGCATGGAGCATACTTCA 


155 


(AT3V* 


H06601 


TGCTTCTCCCTCTGCCTGT 


tggtgaaagattagcctotgga 


125 


(AT3)3 (AT4XA 

•np 




AAOTCCCACXOCAOOCTC 


ACGTCACCACAACCATCTAA 


165 


<AT3)12 




CATTTGCTG AGT CAAGGAATTCT 


AGTTAOCTGGAACTTGTCAGAA 


300 


<AT3)12 


K0S503 


1 GCA1 GGAUOCl UCl TCI 


C11C\ ACACATG TTGTOCCT 


160 


(AT6XAT4)2(AT3) 
13 




AGTCCAGC ATCACCO 1 1 IU1 


GAGGCTTATTTTCTGTCCAOTT 


144 


fAT3WAT4> 




1 CAOGC rCA 1 GGGaTTGAG acti c 


TGCCATTGC ACAGG ATATAGGTCC A 


305 


(A73)U 




TCCACACTCAGTGCAGAATCTGCTT 


TGTGAGACCGCAGAATACAGTACTC 


Ml 


(AT3)J1 
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Amplrfication reactions were carried out under standard PGR conditions 
descnbed above using the annealing temperature indicated for each locus or a 
touchdown PGR protoco. (Don. R.H. et al.. Nucleic Acids Res. 19:4008 (1991)) was 
established. The variably of these loci were evaluated using the dog panel For 
each locus. 5-10 dogs were studied in each breed. The number of alleles observed 
are presented in Tables 3A and 3B. 

Table 3A 
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Table 3B 



Marker 

I Locus 


Doberman 
Pinscher 


Siberian 
Husky 


Scottish 
Terrier 


English 
Pointer 


Greyhound 


D00101 


3 


2 


2 


3 


o 


D00401 


3 


6 


5 


5 


c 
O 


D01205 


2 


2 


1 


3 


O 


D01902 


5 


3 


4 


4 


/ 


D02001 


2 


4 


3 


o 


3 


D02005 


1 


3 


2 




3 


D0201 1 


2 


3 


4 


c 


2 


D02012 


3 


3 


4 




3 


D02202 


1 


3 


2 


2 


1 


D03709 


4 


6 


4 


5 


4 


D03805 


3 


7 


4 


5 


4 


D03908 


3 


8 


3 


4 


4 


D04403 


1 


3 


2 


3 


3 


j D04702 


2 


3 


2 


3 


2 



In general, all of the microsatellite loci tested displayed variability within and 
across breeds. While 9 cells out of 140 (6.4%) in Tables 3A and 3B were 
20 monomorphic, these were scattered though 6 different microsatellite loci, which were 
quite polymorphic in other breeds. The maximum number of alleles detectable by 
this analysis for a locus in a given breed was 8, in the case of locus D3908 in the 
Siberian Husky. The percent heterozygosity observed at each locus in each breed 
is presented in Tables 4A and 4B. 
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Table 4A 



5 



10 



15 



Marlrar 

Locus 


Mixed 
Breed 


Cocker 
Spaniel 


Labrador 
Retriever 


German 

Shcmhorrf 


Beagle 


D00101 


20 


0 


0 


n 
u 


90 


D00401 


100 


100 


100 




25 


DO 1205 


70 


50 


o 


ZZ 


64 


D01902 


100 


100 


100 


1 1 


36 


D02001 


40 


86 


57 


o0 


33 


D02005 


90 


29 


38 


22 


27 j 


D02011 


38 


0 


25 


A A 

44 


18 


D02012 


0 


17 




0 


33 


D02202 


20 


0 


0 


0 


0 


D03709 


20 


100 


75 


89 


50 


D03805 


100 


50 


50 


30 


67 


D03908 


100 


100 


100 


88 


100 


D04403 


100 


100 


100 


100 


100 


D04702 


22 


0 


80 


0 


30 
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Table 4B 



1 Marker 

I Awl 

1 Locus 


Lrooeiiiiafl 
Pinscher 


olberian 
Husky 


Scottish 
Terrier 


English 
Pointer 


Greyhound 


D00101 


60 


0 


78 


86 




D00401 


33 


50 


86 


67 


inn 


D01205 


60 


44 


0 


86 




D01902 


100 


63 


100 


100 


inn 


| D02001 


100 


57 


25 


50 


to 


D02005 


0 


50 


77 


71 


1UU 


D02011 


20 


33 


44 


43 


OU 


D02012 


0 


50 


17 


40 


A 

u 


D02202 


0 


0 


17 


17 


0 


D03709 


100 


78 


100 


86 


100 


D03805 


100 


67 


100 


80 


29 


003908 


33 


44 


100 


100 


100 


D04403 


30 


50 


56 


14 


29 


D04702 


67 


20 


33 


60 I 40 
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No heterozygotes were observed in only 21 out of 140 (15%) of the loci/breed 
combinations studied. At the same time. 30 out of 140 (21%) cells showed 100% 
heterozygosity. The mean and standard deviation of heterozygosity observed for 
each locus across different breeds, as well as the mean and standard deviation of 
5 heterozygosity observed within each breed across different loci are shown in Figures 
1A and 1B. respectively. The breeds studied show a mean heterozygosity ranging 
from 36 to 60% across different microsatellrte loci with considerable standard 
deviations. Among the loci studied D03908. D01902. D03709 and D00401 showed 
the highest mean heterozygosity across breeds of 87. 81. 80 and 75%. respectively. 
10 The number of repeats in the reference clone in these loci were 16.18.12 and 22. 
The least informative loci across breeds were D02202 and D02012 at 5 and 19% 
mean heterozygosity, respectively. The number of repeats in the reference clone 
in these loci are 12 and 15. respectively. Correlation analysis did not reveal any 
significant linear relationship between the number of repeats at a locus and its 
5 overall observed heterozygosity (r=0.22). 

Figures 2A-2D show the results from typical gels used to evaluate the alleles 
in gathering the data as described above. Amplification products of DNA from 
various different breeds at the locus D02011 are shown. Figures 2A-2D represent 
different gels, run under similar conditions. Note that the molecular weight marker 
0 identified in lanes marked M is the 246 bp band of the 123 bp ladder (Gibco-BRL, 
Gaithersburg. MD). The size of the amplification product in the reference clone was 
238. The different alleles are easily identified, with PCR products separating in 
sharp and well resolved bands, near and below the 246 bp marker. Some 
non-specific amplification products can be observed, especially in cases with higher 
template DNA concentrations; however, these do not interfere with correct typing. 

The results indicate that microsatellite loci containing CA repeats are 
abundant and highly polymorphic markers for the canine genome. These findings 
indicate that such markers hold great potential for use as linked markers for genetic 
defects in pure bred dogs. 

The estimate that there is one useful CA repeat every 31 kb in the canine 
genome is in good agreement with one every 42 kb estimated recently by others 
(Rothuzien. J. etal.. Theor. App. Genet. 89:403-406 (1994)). In the above-described 
study, a secondary screening was carried out and only very strong hybridization 
signals were accepted as positive, which resulted in elimination of about 20% of the 
primary positives. It thus appears that the estimate of the minimal CA microsatellites 



25 



30 



35 
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frequency in the canine genome is accurate. These estimates have practical 
implications particularly, since most cosmids have insert sizes in the 30^0 kb range, 
the likelihood of finding a useful CA repeat in a cosimd clone harboring a gene of 
interest is high. 

5 SPECIFIC EXAMPLE II 

Materials and Methods 

Patients and pedigrees. The patients and pedigrees used were primarily 
those used and described earlier (Yuzbasiyan-Gurkan. V. et al.. Genomics 1 5:86-90 
(1993)). Briefly, pedigrees of American Kennel Club registered Bedlington terriers 
10 were associated with the help of Bedlington terrier (BT) breeders. While all of the 
pedigrees have a family history of CT. not all had a symptomatic proband at the time 
of pedigree ascertainment. Diagnosis of dogs as to whether they were affected or 
unaffected with CT was made in all cases by quantitative copper assay from liver 
biopsies performed at 1 year of age or older by criteria earlier described. DNA was 
15 extracted from peripheral blood samples collected in acid-citrate-dextrose as 
anticoagulant as described (Yuzbasiyan-Gurkan. V. et al.. Genomics 15 86-90 
(1993)). 

Microsatellite analysis. The microsatellite markers used in this study were 
developed as described in Specific Example I. Standard conditions used to amplify 
each marker locus in polymerase chain reactions (PCR) were as follows: 25-50 ng 
of genomic DNA as template in 25 //I of PCR buffer (50 mM Tris HCI. pH 8.3 @ 
25-C. 50 mM KCI. 1.5 mM MgCI 2 ). 200 „M dNTPs, 200 P M with respect to each 
primer and 1.5 U of Taq DNA polymerase. A touchdown PCR protocol (Don. R H 
et al., Nucleic Acids Res. 19:4008 (1991)) was established to facilitate the robust 
amplification of most markers under the same conditions. PCR was earned out at 
94»C for 45 sec. 52»C for 30 sec. and 72-C for 1 min. 

The microsatellite markers were initially evaluated in ten sets of parents from 
the BT pedigrees. Those markers for which at least one parent was heterozygous 
were then evaluated in all the dogs in the pedigree. Seven to twelve microliters of 
product were run on a 5% to 7% Hydrolink D600 acrylamide horizontal gel according 
to the manufacturer's instructions with the following modification. During the 
overnight runs, a plexiglas gel carrier was placed on top of the gel to prevent the 
swelling and distortion that was otherwise observed. Initially, electrophoresis was 
carried out from 4 to 5 hr. at 50 V in 1 X TBE (90 mM Tris. pH 8.3. 90 mM boric 
acid. 2 mM EDTA) with ethidium bromide. A photograph was taken and the gel 



20 
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10 



15 



20 



25 



electrophoresis then continued overnight at 3JM0 volts depending on the fragment 
size of the product. A second photograph was taken and the results visually 
evaluated. It was found that two photographs were helpful in comparing different 
dogs with similar patterns. The alleles were then tabulated and used in linkage 
analysis. 

Linkage analysis. Two point LOD (logarithm of odds) scores between CTand 
all the markers tested were generated using the MLINK program of the LINKAGE 
package (v5.1) (Lathrop, G.M. et al.. PNAS (USA) 81:3443-3446 (1984)). A gene 
frequency of 0.5 was assumed for CT. 

Results 

Two hundred thirteen microsatellite markers were evaluated in the process 
of finding linkage. Of these 213 markers. 181 provided scorable products in BTs 
using the touchdown protocol described above. Of these. 114 were informative in 
the pedigrees and were further evaluated. 

Of all the markers tested for linkage to CT. only one yielded a significant LOD 
score. As shown in Table 5 below, marker number C04107 was found to be linked 
to the CT locus at a LOD score of 5.96. at a recombination fraction of zero. No 
recombinants were detected. Since a LOD score of 5.96 indicates that the odds of 
observing this linkage by chance is about 1 in a million, and since, a LOD score of 
greater than 3 or an odds ratio of 1 in 1000 is considered proof of linkage, the 
findings imply that the CT locus is indeed very close to the C04107 locus and thus 
can be used to predict the inheritance of alleles at the CT locus. No recombinants 
were detected in this study and thus a value can not be put on the genetic distance 
between these loci, except to say that they are very close. 

Table 5 





| 

e 

(Recombin- 
ation 
Fraction): 


0.0 


0.001 


0.01 


0.05 


0.15 


0.1 


0.2 


0.3 


30 


C04107 vs. 
CT 


5.96 


5.95 


5.85 


5.38 


4.78 


4.14 


3.49 


2.13 




C04107 vs. 
ESD 


-09 


-19.73 


-10.78 


-4.77 


-2.44 


-1.28 


-0.6 


-0.01 


35 


C04107 vs. 
RB1 


•OS 


-20.35 


-11.43 


-5.47 


-3.18 


-2.01 


-1.28 


-0.47 
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The primer sequence and allele information about this marker are shown in 
Table 6. The allele frequencies were determined from alleles observed in apparently 
unrelated dogs. 

Table 6 



Marker Locus 


C04107 


Repeat Motif in Reference Clone 


(CA) 6 CT(CA)„ 


Primer Pair 


TCAG CAACTATACATTTAAG AGGA 
CTGTCCCATCTAAAGGATAGG 


Allele 1 and Frequency 


163 bp, 0.39 


| Allele 2 and Frequency 


167 bp. 0.61 



10 



15 



20 



Marker C04107 was used to locate markers C04107B and C04107C shown 
in Table 2A. which are close to C04107 and also contain repeats. This "family" of 
markers may be used to detect CT. 

A typical pedigree illustrating linkage to C04107 is shown in Figure 3. In 
Figure 3, circles and squares depict females and males, respectively, and individuals 
affected with CT are indicated by the filled symbols. The asterisk in the figure 
indicates an individual not available for analysis. The bands are the negative image 
of amplification products obtained from the dogs indicated in the pedigree and 
analyzed individuals share the 2,2 genotype at this locus. In this pedigree, all dogs 
with the 1.1 genotype are predicted to be homozygous normal while those with the 
1.2 genotype are predicted to be heterozygous, and thus carriers of the CT gene. 

Given the finding of linkage and allowing for a small error for recombination, 
it is predicted that all the offspring with the 1, 1 genotype are clear of the CT gene 
i.e., homozygous normal, and that all 1, 2 offspring are carriers in this pedigree. 

Since data on the ESD and RB1 loci were available for most of the dogs from 
a previous study (Yuzbasiyan-Gurkan. V. et al., Genomics 15:86-90 (1993)), the 
linkage relationships of these loci with C04107 were was also evaluated. Neither 
ESD or RB1 were found to closely finked to C04107 (see Table 5). 

As demonstrated by the pedigree illustrated in Figure 3, given an informative 
mating, it is now possible to identify all the genotypes in the offspring, distinguishing 
30 between the homozygous normal, homozygous affected and heterozygous dogs 
provided the genotype of one affected dog is available. However, C04107 is not 
extremely polymorphic in the BT population, showing only two alleles and a 
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calculated heterozygosity of 0.43. Therefore, typing at the C04107 will not always 
yield information about the CT status of the offspring. Thus far. all affected dogs 
have been of the 2,2 genotype and the 2 allele is more common than the 1 allele 
(see Table 6). The matings which produce affected dogs will be found to be either 
i between parents who are both 2.2 both 1.2 or one 1.2 and the other 2,2. In such 
cases, typing at the C04107 locus will only be useful in the second and third mating 
types. In the latter mating pairs, predictive information would only be available as 
to which dogs are affected. In order to make most pedigrees in the breed 
informative, additional polymorphic markers closely linked toC04107 are developed.. 
It is predicted that a battery of three to five highly polymorphic markers will make 
almost every pedigree informative. 

If strong linkage disequilibrium occurs at C041 07 or nearby loci, the predictive 
power will be substantially improved. However, further studies of allele distributions 
in the BT population are needed to evaluate linkage disequilibrium, in any case, it 
should be possible to dramatically reduce the frequency of this serious disease 
within a very few generations. 

As discussed above, canine copper toxicosis is present in the West Highland 
White Terrier and perhaps in several other breeds. (Thornburg. LP. et al., Vet. 
Pathol. 27:81-88 (1990)). In the West Highland Terrier, it is clear that the phenotype 
is more complex, in that there is a spectrum of liver copper levels. This marker is 
evaluated in the West Highland White Terrier breed and it is determined whether 
there is segregation of high liver copper values with C04107. 

The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
from such discussion and from the accompanying claims and drawings, that various 
changes, modifications and variations can be made therein without departing from 
the spirit and scope of the invention. 

All publications referred to herein are expressly incorporated by reference. 
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WE CLAIM: 

1 A primer comprising a polynucleotide, wherein the polynucleotide has 
a sequence selected from the group consisting of the sequences of Table 2A. 

2. The primer of Claim 1 . wherein the sequence is the Sns sequence of 
5 marker locus C04107 of Table 2A. 

3. The primer of Claim 1 , wherein the sequence is the Asn sequence of 
marker locus C04107 of Table 2A. 

4. The primer of Claim 1 , wherein the sequence is the Sns sequence of 
the marker locus C04107B of Table 2A. 

10 5 The P rimer of c,a 'm 1. wherein the sequence is the Asn sequence of 

the marker locus C04107B of Table 2A. 

6. A method for amplifying DNA. comprising the step of performing PCR 
with the DNA and a primer set selected from the group consisting of the primer sets 
of Table 2A. 

15 7 - Tne method of Claim 6, wherein the primer set is that shown as the 

Sns sequence and Asn sequence of the marker locus C04107 of Table 2A. 

8. The method of Claim 6. wherein the primer set is that shown as the 
Sns sequence and Asn sequence of the marker locus C04107B of Table 2A. 
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